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1  Work  Performed  within  This  Reporting  Period 

In  this  reporting  period,  we  performed  the  following  tasks. 

•  Enhanced  the  capabilities  for  top  K  user  and  hashtag  subgraph  querying  and 
visualization.  We  have  enhanced  subgraph  matching  queries  to  retrieve  top  K 
users/hashtags  and  their  neighborhoods,  and  further  improved  Scraawl  graph 
visualization  to  visualize  the  results. 

•  Enhanced  the  capabilities  for  friendship  graph  querying  and  visualization.  We 

have  implemented  a  service  to  retrieve  friends  and  followers  of  an  arbitrary  set  of 
users,  and  further  improved  Scraawl  graph  visualization  to  visualize  the  results. 

1.1  Top  K  User  and  Hashtag  Subgraph  Querying  and  Visualization 

1.1.1  Top  K  Query  Implementation 

As  reported  in  Progress  Report  No.  1,  we  consider  the  Twitter  interaction  graph  modeled 
as  an  undirected  graph  G  —  (V,  E),  where  V  is  the  set  of  nodes  (vertices)  and  E  is  the  set 
of  edges.  For  a  collection  of  tweets  (t(0)  |  6  G  Z+],  where  each  tweet  r(. )  can  be  uniquely 
identified  by  its  unique  tweet  ID  9  G  7L+ ,  let  r (0)  be  tweeted  by  user  uT(0)  and  let  uT^ 

have  retweeted,  mentioned,  or  replied  to  Kg  users  j(r(0))  =  v^gy  ... ,  Of 

course,  if  no  retweets,  mentions,  or  replies  are  present,  then  j(r(0))  =  0.  We  can  then 
unambiguously  specify  the  Twitter  Interaction  Graph  G  with  V  —  (itT(g)  U  J(t(0))|  9  G 
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TL+]  and  E  =  {ur(g)  x  J(t(0))|  9  E  Z+}  =  {(ur(g)l  v}(g)), 

(uT(g),  v?(g)), ... ,  (uT(g),  v?(g))\9  E  T+ }. 


Our  interest  lies  in  the  querying  of  this  graph.  We  first  define  a  generic  query  operator  over 
the  graph  that  will  return  a  subset  of  the  graph,  i.e.,  Q(.)\G  ■-»  G.  In  this  reporting  period, 
we  have  designed  and  implemented  topi  K  neighborhood  queries.  Formally,  we  define  the 
top  K  neigborhood  query  as 

Q  —  G(V',E ')  where  V'  —  vt  U  IV (1^)  ^  and  s.  t.  E'  —  ^J(v£,  V/(i)) , 

where  Vj(i )  G  N(yi) 

for  a  set  of  “top”  Ui^t  vertices  and  N (vj  denotes  the  immediate  neighborhood  of  V[. 
Recall  from  Report  1  that  we  have  designed  and  implemented  a  graph  querying  system 
using  the  graph  database  OrientDB.  We  have  added  the  above  querying  capability 
implemented  using  OrientDB  and  made  it  available  through  Scraawl. 

1.1.2  Top  K  query 

In  particular,  we  have  implemented  top  K  neighborhood  queries  for  the  following  top  K. 
Top  K  Users:  The  top  tweeting  users. 

Top  K  Hashtags:  The  top  tweeted  hashtags. 

Top  Connected  Users:  Top  users  ranked  by  degree  in  the  Social  Graph. 

Top  Connected  Hashtags:  Top  hashtags  ranked  by  degree  in  the  Social  Graph. 

We  have  also  enhanced  the  visualization  for  top  K  queries.  We  have  improved  the 
visualization  by  (i)  automatically  adjusting  zooming  and  translating  so  that  the  graph  fits 
into  the  visible  screen,  (ii)  incorporating  web  workers,  i.e.,  background  computation 
threads,  so  that  graph  layout  is  computed  without  interfering  with  user’s  interaction  with 


i  As  defined  according  to  context  as  shown  in  Section  1.1.2. 
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the  page,  and  (iii)  adjusted  the  parameters  of  the  graph  layout  so  that  there  is  less  cluttering. 
Figure  1  is  a  representative  visualization  for  a  top  10  query  as  integrated  into  Scraawl. 


1.1.3  Graph  Visualization  for  Friendship  Graph 

We  have  also  designed  and  implemented  a  service  that  retrieves  the  friends  and  followers 
of  a  given  user  and  visualizes  it  in  Scraawl.  Figure  2  is  a  representative  graph  visualization 
of  the  friendship  graph  of  two  users.  The  “queried  users”  are  depicted  as  orange,  the 
common  friends/followers  are  shown  as  green,  and  the  rest  of  the  users  are  drawn  in  gray. 
Unlike  the  Social  graph,  the  friendship  graph  is  directed  hence  the  direction  of  links  are 
represented  in  the  visualization.  The  relationship  “being  a  friend”  and  “being  a  follower” 
are  distinguished  by  using  orange  and  gray  colors. 
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2  Current  Problems 

None. 

3  Work  to  be  Performed  in  the  Next  Reporting  Period 

In  the  next  report  period,  we  will  focus  on  the  following  tasks: 

•  We  will  start  executing  Task  2. 

•  We  will  deliver  Scraawl  1.15.0. 
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